Coherence-based Dereverberation for Automatic Speech Recognition

نویسندگان

  • Andreas Schwarz
  • Andreas Brendel
  • Walter Kellermann
چکیده

The idea of performing dereverberation using a short-time spatial coherence estimate dates back to 1977 [1], when it was proposed to essentially use the magnitude of the coherence as gain for reverberation suppression. Another heuristic method was recently proposed in [2], where a soft threshold function is used to compute a gain from the coherence magnitude, and the parameters of the threshold function are adapted depending on the histogram of the coherence magnitude in each frequency bin. Short-time coherence estimates have also been investigated in the context of beamforming as a so-called postfilter, and solutions for supression of uncorrelated and diffuse noise have been proposed [3]. In this contribution, we focus on methods where, first, the ratio between direct and reverberation signal components (coherent-to-diffuse ratio, CDR) is estimated from a short-time coherence estimate, and filter weights for reverberation suppression are computed from the CDR using, e.g., the Wiener filter or spectral subtraction rule. We compare and illustrate the behavior of a number of different CDR estimators that have been proposed over the past years, and propose a new variant. Finally, we compare the practical effect of the methods by processing reverberated speech and evaluating the recognition accuracy achieved by an automatic speech recognizer with the processed signals.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing Wavelet Parameters for Dereverberation in Automatic Speech Recognition

We present an optimization method of the wavelet parameters for dereverberation in automatic speech recognition (ASR). By tuning the wavelet parameters to improve the acoustic model likelihood, wavelet-based dereverberation methods become more effective in the ASR application. We evaluate several existing wavelet-based methods and optimize them, based on our proposed scheme. Experimental evalua...

متن کامل

A Simplified Decoding Method for a Robust Distant-talking Asr Concept Based on Feature-domain Dereverberation

A simplified decoding method for the concept of REverberation MOdeling for Speech recognition (REMOS) [1] is proposed. In order to achieve robust distant-talking Automatic Speech Recognition (ASR), the REMOS concept uses a combination of clean-speech HMMs and a reverberation model to perform feature-domain dereverberation during decoding. The simplified decoding/dereverberation method proposed ...

متن کامل

An improved wavelet-based dereverberation for robust automatic speech recognition

This paper presents an improved wavelet-based dereverberation method for automatic speech recognition (ASR). Dereverberation is based on filtering reverberant wavelet coefficients with the Wiener gains to suppress the effect of the late reflections. Optimization of the wavelet parameters using acoustic model enables the system to estimate the clean speech and late reflections effectively. This ...

متن کامل

Dereverberation with an Iterative Least-Squares Technique and Minimum Mean-Square Error Estimation for Automatic Speech Recognition

This work is about dereverberation for automatic speech recognition. The use of a linear minimum mean-square error estimator for enhancing a recently proposed dereverberation method is investigated. The conducted phoneme recognition experiments show that the resynthesis step, which was done in the original work of the dereverberation method, can be omitted. Furthermore, it is shown that the rec...

متن کامل

Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation

A speech signal captured by a distant microphone is generally smeared by reverberation, that severely degrades both the speech intelligibility and Automatic Speech Recognition (ASR) performance. Previously, we proposed a novel dereverberation method, named “Harmonicity based dEReverBeration (HERB)”, which estimates the inverse filter of an unknown impulse response by utilizing the inherent spee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014